Impact of Numa Effects on High-speed Networking with Multi-opteron Machines
نویسندگان
چکیده
The ever-growing level of parallelism within the multi-core and multi-processor nodes in clusters leads to the generalization of distributed memory banks and busses with nonuniform access costs. These NUMA effects have been mostly studied in the context of threads scheduling and are known to have an influence on high-performance networking in clusters. We present an evaluation of their impact on communication performance in multi-OPTERON machines. NUMA effects exhibit a strong and asymmetric impact on highbandwidth communications while the impact on latency remains low. We then describe the implementation of an automatic NUMA-aware placement strategy which achieves as good communication performance as a careful manual placement, and thus ensures performance portability by gathering hardware topology information and placing communicating tasks accordingly.
منابع مشابه
Measuring NUMA effects with the STREAM benchmark
Modern high-end machines feature multiple processor packages, each of which contains multiple independent cores and integrated memory controllers connected directly to dedicated physical RAM. These packages are connected via a shared bus, creating a system with a heterogeneous memory hierarchy. Since this shared bus has less bandwidth than the sum of the links to memory, aggregate memory bandwi...
متن کاملMainstream NUMA and the TCP/IP stack
Historically, approaches to achieving scalable networking focused on reducing the host processing requirements associated with TCP/IP interrupts. In the many‐core era, approaches such as interrupt moderation, jumbo frames, and the TCP Offload Engine prove inadequate because the modest increases in the processing speeds forecast for host computers are not keeping pace with improvements in networ...
متن کاملVMware ESX Server 2 NUMA Support
ESX Server 2 provides memory access optimization for both Intel processors and AMD Opteron processors in server architectures that support NUMA (nonuniform memory access). This white paper provides background on NUMA technologies and a detailed description of the sophisticated NUMA optimizations available in ESX Server 2. The document contains the following sections: • Introduction • What is NU...
متن کاملExploring Thread and Memory Placement on NUMA Architectures: Solaris and Linux, UltraSPARC/FirePlane and Opteron/HyperTransport
Modern shared memory multiprocessor systems commonly have non-uniform memory access (NUMA) with asymmetric memory bandwidth and latency characteristics. Operating systems now provide application programmer interfaces allowing the user to perform specific thread and memory placement. To date, however, there have been relatively few detailed assessments of the importance of memory/thread placemen...
متن کاملContributions au contrôle de l'affinité mémoire sur architectures multicoeurs et hiérarchiques. (Contributions on Memory Affinity Management for Hierarchical Shared Memory Multi-core Platforms)
Multi-core platforms with non-uniform memory access (NUMA) design are now a common resource in High Performance Computing. In such platforms, the shared memory is organized in an hierarchical memory subsystem in which the shared memory is physically distributed into several memory banks. Additionally, these platforms feature several levels of cache memories. Because of such hierarchy, memory ac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007